Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection

نویسندگان

  • Qichao Yu
  • Wei Zhang
  • Xiaolong Zhang
  • Yongli Zeng
  • Yeming Wang
  • Yanhui Wang
  • Liqin Xu
  • Xiaoyun Huang
  • Nannan Li
  • Xinlan Zhou
  • Jie Lu
  • Xiaosen Guo
  • Guibo Li
  • Yong Hou
  • Shiping Liu
  • Bo Li
چکیده

Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient DNA Fingerprinting Based on the Targeted Sequencing of Active Retrotransposon Insertion Sites Using a Bench-Top High-Throughput Sequencing Platform

In many crop species, DNA fingerprinting is required for the precise identification of cultivars to protect the rights of breeders. Many families of retrotransposons have multiple copies throughout the eukaryotic genome and their integrated copies are inherited genetically. Thus, their insertion polymorphisms among cultivars are useful for DNA fingerprinting. In this study, we conducted a DNA f...

متن کامل

Genome-wide LORE1 retrotransposon mutagenesis and high-throughput insertion detection in Lotus japonicus.

Use of insertion mutants facilitates functional analysis of genes, but it has been difficult to identify a suitable mutagen and to establish large populations for reverse genetics in most plant species. The main challenge is developing efficient high-throughput procedures for both mutagenesis and identification of insertion sites. To date, only floral-dip T-DNA transformation of Arabidopsis has...

متن کامل

Construction of a linkage map based on retrotransposon insertion polymorphisms in sweetpotato via high-throughput sequencing

Sweetpotato (Ipomoea batatas L.) is an outcrossing hexaploid species with a large number of chromosomes (2n = 6x = 90). Although sweetpotato is one of the world's most important crops, genetic analysis of the species has been hindered by its genetic complexity combined with the lack of a whole genome sequence. In the present study, we constructed a genetic linkage map based on retrotransposon i...

متن کامل

Human Retrotransposon Insertion Polymorphisms Are Associated with Health and Disease via Gene Regulatory Phenotypes

The human genome hosts several active families of transposable elements (TEs), including the Alu, LINE-1, and SVA retrotransposons that are mobilized via reverse transcription of RNA intermediates. We evaluated how insertion polymorphisms generated by human retrotransposon activity may be related to common health and disease phenotypes that have been previously interrogated through genome-wide ...

متن کامل

Somatic retrotransposition in human cancer revealed by whole-genome and exome sequencing.

Retrotransposons constitute a major source of genetic variation, and somatic retrotransposon insertions have been reported in cancer. Here, we applied TranspoSeq, a computational framework that identifies retrotransposon insertions from sequencing data, to whole genomes from 200 tumor/normal pairs across 11 tumor types as part of The Cancer Genome Atlas (TCGA) Pan-Cancer Project. In addition to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2017